Quasi-automatic extraction of tongue movement from a large existing speech cineradiographic database
نویسندگان
چکیده
Automatic analysis of tongue movement in large existing cineradiographic databases can provide valuable information to understood speech production. We describe here a method for semi-automatic extraction of articulatory information from video observation in order to derive quasi-automatically a geometrical parameterization of the vocal tract movements. The algorithm starts with a limited manual processing step consisting in marking 10 points (12 degrees of freedom) on 100 chosen key images. The treatment on the whole sequence is then automatic thanks to a retro-marking method. At first, the whole database is indexed via a similarity measure performed with the key images. Then, we associate on the original images the geometrical information recovered on the key images via this indexing. Different complementary error reduction methods are also proposed. Averaging geometrical configurations of a neighborhood, temporal filtering and spline interpolation allow to reduce the reconstruction error to about 10 pixels for a tongue contour of average length of 260 pixels.
منابع مشابه
Quasi-automatic extraction of tong large existing speech cineradiog
Automatic analysis of tongue movement in large existing cineradiographic databases can provide valuable information to understood speech production. We describe here a method for semi-automatic extraction of articulatory information from video observation in order to derive quasi-automatically a geometrical parameterization of the vocal tract movements. The algorithm starts with a limited manua...
متن کاملA semi-automatic method for extracting vocal tract movements from X-ray films
Despite the development of new imaging techniques, existing X-ray data remain an appropriate tool to study speech production phenomena. However, to exploit these images, the shapes of the vocal tract articulators must first be extracted. This task, usually manually realized, is long and laborious. This paper describes a semi-automatic technique for facilitating the extraction of vocal tract con...
متن کاملSemi-automatic extraction of vocal tract movements from cineradiographic data
Since high speed X-ray films still provide the best dynamic view of the entire vocal tract, large existing databases have been preserved and are available for the speech research community. We propose a new technique for facilitating the extraction of the vocal tract shape and the movements of the articulators from complete sequences of these databases. The method was first developed for the ex...
متن کاملArticulatory synthesis driven by geometrical contours of the vocal tract extracted from cineradiographic data
Recently, we have proposed a new technique for facilitating the extraction of vocal tract contours from complete sequences of large existing cineradiographic databases. The articulators (tongue, tongue tip, velum, lips, etc.) are processed independently before being combined to reconstruct the whole vocal tract. Applied to one sequence of the ATR database, Laval43, the method allows us to estim...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کامل